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1.  INTRODUCTION 


This  paper  describes  a  study  aimed  at  segmenting  a  high  resolution  black 
and  white  (B/W)  digital  image  of  Sunnyvale,  California.  This  scene  con¬ 
tained  a  total  of  47  Defense  Mapping  Agency's  Mapping,  Charting  and  Geodesy 
(M,  C  &  G)  tangible  features.  The  objective  of  the  study  was  to  segment 
the  scene  into  regions  which  correspond  to  as  many  of  these  47  M,  C  &  G 
features  as  possible.  Figure  1  shows  a  facsimile  of  the  image.  Table  1 
shows  the  9  land  use  classes  considered  in  this  study  and  their  correspondence 
to  the  M,  C  &  G  features.  Clearly  the  classes  chosen  do  not  give  as  detailed 
an  image  segmentation  as  required.  The  rationale  for  the  choice  of  these 
classes  is  given  in  Section  4.  Methods  for  achieving  a  more  detailed  seg¬ 
mentation  using  the  methods  presented  in  this  paper  are  described  in  Section  3. 

The  time  constraints  of  the  study  necessitated  that  readily  available  tech¬ 
niques  be  employed.  Yet  it  was  desired  to  have  the  methods  used  be  as 
general  as  possible.  For  these  reasons  texture  analysis  methods  were  utilized. 
In  particular,  the  spatial  gray  level  dependence  method  (SGLDM)  was  selected 
because  two  comparison  studies  [1,2]  have  shown  It  to  be  superior,  real 
world  studies  have  demonstrated  its. capabilities  [3, 4, 5, 6, 7, 8, 9] ,  and  per¬ 
ceptual  psychology  studies  [10,11]  have  shown  it  to  match  a  level  of  human 
perception. 

In  selecting  the  segmentation  procedure  the  desire  for  generality 
led  to  the  consideration  of  split,  merge,  snd  split  and  merge  types  of 
procedures  [12,13].  A  split  type  approach  was  the  one  selected.  This 
procedure  used  texture  measures  extracted  from  a  region  R  to  determine 
whether  or  not  R  Is  composed  entirely  of  one  of  K  known  classes.  If  R 
contains  one  of  these  classes  then  It  is  appropriately  labeled.  If  not. 


CLASS  NAME 


M,C  &  G  TANGIBLE  FEATURE  NAME 


1.  Residential  Area  Houses  (Single  Family) 

Apartment /Hotel* 

2.  Mobile  Home  Area  Mobile  Homes 

3.  Vehicle  Parking  Area  Vehicle  Parking  Area 

Vehicle  Storage /Motor  Pool 

4.  Aircraft  Parking  Area  Aircraft  Parking  Area/Apron 

5.  Runway  Runway /Taxiway 

Heliport 

6.  Hater  Salt  Pan /Evaporators  / 

Lake/Pond  Reservoir 

7.  Dry  Land  Mineral  Pile 

Dry  Land  (Bare /Barren  Soil/ 
Non-Cultimated 
Levee /Embankment /Fill 
Crop  (Cultivated) 

Deciduous  Woodland 

8.  Multilane  Highway  Multilane,  Divided  Highway 

(Gross  Median) 

Multilane  Highway 
Cloverleaf /Interchange 

9.  Commercial/Industrial  Fabrication  Industry  Building 

Scrap  Yard 
Industrial  Building 
Industrial  Conveyor 
Industrial  Rotating  Crane 
Commercial  Building 
Apartment /Hotel* 

Barracks 

Governmental  Administration  Bldg. 
Military  Admln/0peratlon6  Bldg. 
School  Building 
RR  Station/Depot 
Airport /Airbase  Control  Tower 
Hangar 

Aerospace  Assembly  Building 
Engine  Test  Cell 
Hind  Tunnel 
■  Warehouse 
Greenhouse 

Drive-In  Theater  Screen 

Table  1.  A  list  of  nine  classes  and  the  M,C  4  G  tangible  features 

combined  together  to  compose  each  class.  Note  that  a  region 
containing  the  M,C  4  C  Apartment /Hotel  feature (*)  is  placed  in 
one  of  two  classes  based  upon  the  appearance  of  this  region. 


then  R  Is  split.  The  Inferences  regarding  R  are  based  on  a  series  of 
hypothesis  tests.  As  such  the  procedure  is  related  to  the  uniformity 
predicate  of  Pavlidis  [12]. 

Experiments  were  performed  which  indicate  the  capabilities  of  the 
segmentation  procedure.  The  purpose  of  one  of  the  experiments  was  to 
determine  whether  texture  measures  could  be  used  to  accurately  classify 
a  region  into  urban  land  use  classes.  For  this  study  the  nine  classes 
in  Table  1  were  used.  The  capability  of  the  texture  measures  is  evidenced 
by  a  training  result  of  90%  overall  correct  classification.  A  worst  case 
segmentation  of  the  scene  was  conducted  with  a  83%  overall  correct  classi¬ 
fication.  Other  experiments  were  performed  to  evaluate  the  mechanisms  used 
in  making  the  decision  to  split.  One  dealt  with  demonstrating  the  ability 
to  Identify  regions  containing  one  or  more  "unknown"  or  "unspecified" 
classes.  Another  dealt  with  Identifying  regions  containing  two  or  more 
"known"  classes,  i.e.,  "boundary"  regions. 

These  experiments  indicate  the  proposed  segmentation  procedure  is 
feasible  and  could  be  useful  in  segmenting  high  resolution  urban  scenes. 
Further  its  generality  is  such  that  it  would  seemingly  be  applicable  to 
a  variety  of  problems. 


4 


8 


2.  TEXTURE  ANALYSIS  METHODOLOGIES 

2.1  Tixtunt  Analytic  and  Land  Ike  CLcut>&iiiuvUon 

Many  different  texture  analysis  operators  have  been  applied  to  the 
land  use  classification  problem.  In  this  section  a  brief  review  of  the 
literature  will  be  given  in  order  to  indicate  the  texture  analysis  methods 
which  have  been  employed  and  the  results  each  has  yielded.  Generally 
speaking,  texture  operators  have  been  used  successfully  on  a  variety 
of  land  use  problems. 

One  of  the  first  attempts  to  use  texture  analysis  methods  for  land  use 
classification  was  made  by  Lendarls  and  Stanley  [14].  They  employed  the 
power  spectral  method  (PSM)  to  analyze  a  high  resolution  B/W  aerial  photo¬ 
graph.  The  two  classes  considered  were  gross  manmade  and  nothing  manmade. 

A  98.82  correct  classification  was  obtained  in  detecting  natural  areas. 

Galloway  [15]  used  the  gray  level  run  length  method  (GLRLM)  to 
classify  54  high  resolution  B/W  aerial  photographs.  The  classes  considered 
were  orchard,  wood,  urban,  suburb,  lake,  marsh,  swamp,  railroad  and  scrub. 
The  percentage  of  correct  classification  obtained  for  each  class  ranged  from 
832  to  1002. 

Hsu  [16]  applied  a  different  texture  algorithm  in  the  analysis  of 
7  B/W  aerial  photographs.  This  method  involved  the  calculation  of  17 
texture  measures  from  3  x  3  or  5  x  5  windows.  The  classes  considered  were 
vegetation,  aoil,  pavement,  composite  field  1  and  2,  and  composite.  The 
best  probability  of  correct  classification  obtained  was  84.32. 

Mitchell  and  Carlton  [17]  made  use  of  the  max-min  method  to  find  roads. 
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grasslands  and  forests  in  B/W  aerial  photographs.  Basically,  the  max-min 
method  involves  locating  local  gray  level  extrema  along  a  scan  line  of 
the  picture.  The  number  of  extrema  of  strength  T  or  greater  is  a  measure 
used  in  making  the  classifications. 

A  study  using  the  SGLDM  was  performed  by  Haralick  et  al.  [6],  The 
data  source  for  the  study  was  1:20,000  scale  B/W  aerial  photographs.  The 
classes  considered  were  old  residential,  new  residential,  lake,  swamp,  marsh 
urban,  rail  and  woodlands.  A  total  of  170  samples  were  considered.  The 
study  yielded  an  82.32  overall  correct  classification. 

The  above  studies  all  involved  reasonably  high  resolution  B/W  photo¬ 
graphy.  However,  texture  analysis  has  proved  useful  in  much  coarser  resol¬ 
ution  imagery  such  as  the  1.1  acre  ground  resolution  data  of  Landsat.  The 
texture  operators  applied  to  these  data  were  used  in  conjunction  with  the 
multlspectral  information  to  yield  improved  classification  accuracies  [4,18] 
Texture  operators  have  also  proved  useful  on  higher  resolution  multispectral 
scanner  (MSS)  data  such  as  aircraft  MSS  [19,20,21]. 

The  studies  of  particular  interest  are  those  involving  high 
resolution  B/W  imagery.  These  studies  can  be  categorized  as  those  where 
selected  samples  were  classified  and  those  where  a  segmentation  of  a  scene 
was  attempted.  Haralick  et  al.  [4]  and  Galloway  [15]  both  classified 
pure  samples  from  a  selected  number  of  classes.  Neither  involved  the 
consideration  of  regions  composed  of  combinations  of  two  or  more  classes. 

On  the  other  hand  Lendaris  and  Stanley  [14],  Hsu  [16],  and  Mitchell  and 
Carlton  [17]  all  attempted  a  segmentation.  Consequently,  they  had  to 
Classify  regions  composed  of  combinations  of  classes.  However  the  number 
and  types  of  classes  considered  in  these  studies  do  not  seem  suitable  for 
urban  scene  analysis.  Also  these  studies  had  no  provision  for  considering 
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Bore  than  one  fixed  primitive  region  size,  i.e.,  no  splitting  or  merging 
type  procedure  was  attempted. 

2.2  The  SGLVM  AtgofUthm 

To  defend  the  selection  of  the  SGLDM,  recall  that  this  algorithm  has 
proven  useful  on  a  variety  of  real  world  problems  ranging  from  the  analysis 
of  human  radiographs  to  land  use  classifications  [1,2,3,4,5,6,7,8,13,18, 
19,22].  (A  complete  survey  of  texture  analysis  is  presented  In  reference  23.) 
Next,  two  comparison  studies  have  shown  the  SGLDM  to  be  a  superior  algorithm. 
One  of  these  studies  [8]  compared  the  relative  merits  of  four  algorithms, 
SGLDM,  GLRLM,  PSM  and  the  gray  level  difference  method  (GLDM),  to  do  terrain 
type  classification.  The  SGLDM  texture  measures  gave  the  best  overall  class¬ 
ification  accuracy.  The  other  comparison  study  [9]  evaluated  the  amount 
of  texture-context  Information  contained  in  the  Intermediate  matrices  of 
these  same  four  algorithms.  Here  again  the  cooccurrence  matrices  used 
by  the  SGLDM  algorithm  were  judged  to  be  the  best.  Further  it  is  worth 
noting  that  Mitchell  et  al.  [17]  made  a  preliminary  comparison  of  the 
max-mln  method  to  the  SGLDM.  The  comparison  was  based  on  the  probability 
of  correct  classification  obtained  by  applying  both  the  max-mia  and  the 
SGLDM  (with  the  energy,  entropy,  correlation,  local  homogeneity.  Inertia 
as  texture  measures)  to  a  set  of  texture  data.  The  results  indicated 
that  the  two  methods  performed  about  equally  well.  This  comparison  to¬ 
gether  with  the  fact  that  the  max-min  method  is  computationally  less 
complex  than  the  SGLDM  would  seemingly  make  it  a  desirable  alternative. 
However,  it  is  believed  that  the  max-min  method  is  an  Innately  weaker 
algorithm.  This  belief  is  based  upon  the  many  simple  textures  which 
cannot  be  discriminated  by  the  max-min  algorithm.  For  example.  Figure  2 
shows  two  visually  distinct  texture  pairs  neither  of  which  can  be  dis¬ 
criminated  by  the  max-min  algorithm. 
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Hsu's  method  [16],  while  not  having  been  directly  compared  to  the 
SGLDM,  only  seem  appropriate  for  a  bottom-up  approach  tc  segmentation. 

Such  approaches  require  the  incorporation  of  world  knowledge  into  the 
analysis  process  in  order  to  idenitfy  structures  such  as  commercial 
building,  parking  lots,  homes,  etc.  Incorporating  such  information  has 
proved  to  be  a  difficult  task  one  which  is  not  generally  well  understood. 

Finally,  the  Julesz  conjecture  [10,11]  supports  the  use  of  cooccurrence 
matrices.  It  should  be  noted  that  recently  a  number  of  counterexamples 
to  the  Julesz  Conjecture  have  been  reported  [24, 25, 26, 27, ZB] .  However 
[29]  indicates  that  these  counterexamples  can  be  discriminated.  Hence, 
at  present  there  is  no  known  example  of  a  visually  distinct  texture  pair 
which  cannot  be  discriminated  by  the  cooccurrence  matrices. 

2.3  Statistical  and  SVmctuAal  TtxtuAe.  biolysis 

The  desire  has  been  to  use  cooccurrence  matrices  not  only  to  dis¬ 
criminate  texture  patterns  but  also  to  be  able  to  use  them  to  characterize 
the  structure  in  textures.  In  response  to  this  motivation  a  model  for 
texture  was  formulated  based  on  mathematical  tiling  theory  [30].  Later 
it  will  be  shown  that  this  model  applies  to  the  urban  land  use  data. 

The  system  which  is  being  developed  to  measure  image  structure  using  the 
cooccurrence  matrices  is  called  the  SSA (statistical  structural  analysis 
system) . 

Vz£initicn  1:  A  tile.  T  is  a  closed  topological  disk. 

2  2 

Ve£i/ution  2:  A  function  o:E  4  E  Is  called  an  iSomeXAy  or 
congruence  transformation  if  it  maps  the  Euclidean  plane  onto  Itself  and 

2 

if  the  function  preserves  distance.  That  is,  if  x  and  £  are  points  in  E 
then  1|*  -  £||  -  ||o(x)  -  o(£>||. 

A  cooccurrence  matrix  S(6,T)  -  [a(i,J,6,T)]  is  a  matrix  of  estimated 
second-order  probabilities  where  each  clement  s(l,j,6,T)  is  the  estimated 
probability  of  going  from  gray  level  1  to  gray  level  J  given  the  displacement 
vector  6  •  (Ax, Ay)  and  T,  the  region  else  and  shape  used  to  estimate  the 


probabilities.  In  this  context  T  is  a  tile  such  that  s(i,J,6,T)  is  estima¬ 
ted  from  the  restriction  of  the  picture  function  g(x)to  o(T)  where  o  is  a 
translation  isometry.  Computationally  S(6,T)  is  determined  using  the  equation 


.0 


5.  Energy 


L-l  L-l 

E(6.T)  -  l  l  [s(i*j»6,T) j 
i-0  j-0 

6.  Entropy 

L-l  L-l 

H(6,T)  — J  £  ®(i.j»i,T)log  (s(i,j,6,T) 

1-0  J-0 

where 

Irl  L-l 

y.  -  J  1  l  s(l, j, 6, T) 

1-0  j-0 

L-l  L-l 

v1  ■  l  l  3  *<l.j,«,T) 

J  1-0  j-0 

end  where  L  Is  the  number  of  gray  levels  In  the  processed  Image. 

2.4  Comment*  on  the  Textu/ie  Measures 

There  are  two  places  where  a  loss  of  Important  texture-context  in¬ 
formation  can  occur.  The  first  is  in  going  from  a  digital  image  to  the 
cooccurrence  matrices.  Results  reported  in  [2,29]  together  with  the  fact 
that  there  is  no  known  visually  distinct  texture  pair  which  cannot  be 
discriminated  by  the  cooccurrence  matrices  suggest  that  little  texture 
Information  loss  occurs  here.  The  other  place  a  possible  loss  can  occur 
is  in  the  transition  from  the  matrices  to  the  set  of  texture  measures.  It 
was  reported  in  [2]  that  the  usual  set  of  texture  measures  used  with  the 
cooccurrence  matrices  namely,  the  energy,  entropy,  correlation,  local 
homogeneity  and  inertia  measures,  do  not  contain  all  the  important  texture- 
context  Information.  For  example.  Figure  3  shows  a  texture  pair  which  can 
*amsily  be  discriminated  using  Information  in  the  cooccurrence  matrices 
but  which  cannot  be  discriminated  based  on  the  values  of  the  usual  five 
texture  measures  computed  from  these  matrices. 
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Figure  3.  A  visually  distinct  texture  pair  which  can  easily  be  discriminated 
based  on  Information  contained  In  the  cooccurrence  matrices  but 
which  cannot  be  discriminated  based  on  the  values  of  the  energy, 
entropy,  correlation,  local  homogeneity  and  inertia  measures  com¬ 
puted  from  these  matrices. 


Recent  studies  [30,31]  have  addressed  this  problem  and  have  resulted  in 
the  definition  of  two  new  measures  and  an  explanation  of  the  utility  of 
the  Inertia  measure.  The  two  new  measures,  the  cluster  shade  and  cluster 
prominence,  are  believed  to  gauge  the  perceptual  concepts  of  uniformity 
and  proximity  [10,32].  Further  it  was  shown  that  the  inertia  measure  can 
be  used  to  gauge  the  qualities  of  texture  periodicity  [33,34]  and  the 
texture  gradient  [33].  Since  the  Inertia,  cluster  shade  and  cluster 
prominence  are  known  not  to  be  sufficient,  in  that  a  visually  distinct 
texture  pair  exists  which  cannot  be  discriminated  by  these  measures,  it  was  de¬ 
cided  to  also  include  the  energy,  entropy  and  local  homogeneity  in  the 
measurement  set  used  in  this  study. 

To  see  how  a  texture  measure  computed  from  the  cooccurrence  matrices 
can  be  used  to  determine  a  visual  quality  of  a  pattern  consider  the 
following  example  Involving  periodicity  detection.  For  simplicity  consider 
a  periodic  texture  composed  of  small  black  squares  appearing  on  a  white 
background.  Assume  that  the  horizontal  distance  between  the  center  of 
one  black  square  to  the  center  immediately  to  the  right  is  1.  Further 
assume  that  this  texture  covers  the  whole  plane.  It  can  be  shown  [31] 
that  the  inertia  measure,  1(6, T),  for  large  enough  T  computed  from  this 
texture  has  the  following  properties: 

i)  I(6,T)  -  0  for  6  -  (i,0*), 

11)  1(6, T)  >  0  for  6  -  (d,0*>,  d  -  1,2,... ,1-1,  and 

iii)  1(6  ,0*)  -  1(6,0*)  for  6  -  (n,0*).  6  -  (md+n.O*)  and 

n  ns  n  did 

n,m  ■  1,2,.... 

i 

Consequently  to  find  1  one  looks  for  the  lntersaaiple  spacing  distance 
which  gives  the  minimum  horizontal  inertia  value,  and  checks  for  periodicity 
in  1(6, T).  The  number  l  then  gives  the  period  of  the  texture  in  direction 

e. 
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The  Importance  of  this  periodicity  detection  is  that  it  can  be  used 
to  identify  a  special  type  of  unit  pattern,  the  period  parallelogram  unit 
pattern  [31].  The  utility  of  the  period  parallelogram  unit  pattern  stems 
from  the  fact  that  any  periodic  texture  can  be  decomposed  into  a  period 
parallelogram  unit  pattern.  Further  only  two  vectors,  a  and  b  shown  in 
Figure  4,  specify  not  only  the  size  and  shape  of  the  period  parallelogram 
unit  pattern  but  the  placement  rules  as  veil.  Section  4.4  6hows  the  im¬ 
portance  of  the  period  parallelogram  unit  pattern  in  the  segmentation  of 


urban  scenes. 


3.  FRAMEWORK  FOR  IMAGE  SEGMENTATION 

The  proposed  segmentation  procedure  is  an  early  vision  system. 
Classically  such  procedures  have  been  based  upon  detecting  edges  or  de¬ 
tecting  uniformity  by  examining  the  histogram  of  the  gray  levels  [36]. 

The  cooccurrence  matrices  contain  edge  Information  as  veil  as  the  first- 
order  probabilities  of  the  gray  levels.  Hence  measures  computed  from 
cooccurrence  matrices  would  appear  to  be  useful  early  vision  operators. 

Further  it  is  desirable  to  have  a  segmentation  procedure  that  would 
allow  one  to  move  back  and  forth  between  segmentation  levels,  verifying 
and  reinforcing  classifications  without  the  need  for  semantic  informa¬ 
tion.  It  is  believed  that  texture  operators  provide  a  means  for  doing 
this.  A  texture  pattern  is  made  up  of  unit  patterns  and  placement  rules 
[31].  One  can  consider  a  commercial  block  as  a  unit  pattern.  The  struc¬ 
tures  such  as  buildings  and  parking  lots  can  be  considered  micropatterns 
of  this  unit  pattern.  Cast  in  this  framework,  the  problem  becomes  one 
of  analyzing  macro  and  micropattems  of  texture.  An  advantage  of  this 
approach  is  that  one  can  use  uniform  data  structures  and  analysis  pro¬ 
cedures  in  considering  different  levels  of  detail.  This  should  provide 
for  a  better  structured  segmentation  method. 

There  are  basically  two  approaches  to  image  segmentation.  These  are 
boundary  detection  and  region  formation  [12,37,38).  Of  these,  region  forma¬ 
tion  approaches  are  best  suited  for  use  with  texture  operators  since  texture 
operators,  inherently,  characterize  the  qualities  of  a  region.  Region 
formation  approaches  can  utilize  either  split,  merge,  or  split  and  merge 
techniques.  A  merge  procedure  is  a  bottom-up  approach  where  small  regions 
are  combined  baaed  upon  some  uniformity  criterion.  Unfortunately,  the 
statistical  characterisation  of  these  small  regions  is  less  reliable 
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than  that  of  larger  ones.  Further  in  complicated  scenes,  such  as  urban 
scenes,  these  small  regions  could  belong  to  anyone  of  a  large  number  of 
primitive  classes  such  as  grass,  concrete,  car,  tree,  etc.  Combining 
these  primitives  to  form  more  meaningful  groups,  i.e.,  cosmercial  area, 
residential  area,  etc.,  would  require  a  substantial  reliance  on  semantic 
information. 

As  an  example,  one  might  decide  to  analyze  a  commercial  area  by  first 
detecting  edges  and  then  linking  them  together  to  form  structures  of  the 
scene.  One  could  then  perhaps  consult  a  model  to  determine  that  these 
structures  comprise  a  commercial  area.  A  problem  with  this  approach  is  that 
each  of  these  subproblems  is  Itself  difficult  and  error  prone. 

A  top-down  procedure,  such  as  a  split  procedure,  seems  to  be  best 
suited  for  use  with  texture  operators  since  classification  accuracies 
obtainable  using  texture  analysis  methods  usually  decrease  as  a  function 
of  region  size.  Therefore  it  is  appropriate  to  use  as  large  a  region  as 
possible  and  divide  it  as  necessary.  Further  for  an  early  vision  system, 
a  split  procedure  seems  most  appropriate  since  it  begins  with  a  few  broad 
classes,  i.e.,  commercial  area,  residential  area,  instead  of  a  building, 
street  or  tree.  Because  of  the  nature  of  the  texture  algorithm  as  these 
large  regions  are  split  the  texture  measures  computed  from  the  smaller 
areas  become  more  sensitive  to  finer  detail.  Consequently,  classes 
whose  differentiation  depends  on  finer  detail  can  be  handled  later  at 
some  higher  level  using  smaller  region  size  and  perhaps  contextual  infor¬ 
mation.  However,  there  are  problems  associated  with  initially  considering 
••a  large  region  size.  For  example  with  large  regions,  it  is  likely  that 
a  mixture  of  patterns  will  be  present.  These  problems  are  addressed  in 


Section  3.2 


The  basic  steps  Involved  in  the  proposed  segmentation  process  are 
illustrated  in  Figure  5.  At  the  first  level  of  the  segmentation  the  scene 
is  Initially  divided  into  R^  regions  such  that  these  regions  cover  the  image. 
The  size  of  the  R^  regions  has  been  determined  during  the  training  phase 
when  the  classes  at  level  1  were  selected.  Next  the  appropriate  texture 
measures  are  computed  from  the  R^  regions.  These  measures  are  also  determin¬ 
ed  during  the  training  phase  when  the  classes  are  selected  for  level 
1.  Each  of  the  R^  regions  is  examined  to  determine  if  it  is  uniform  and 
should  be  labeled  with  one  of  the  class  labels.  If  the  region  cannot 
be  labeled,  l.e.  if  it  is  a  boundary  region  or  it  contains  one  or  more 
unspecified  classes,  then  it  is  passed  unlabeled  to  the  level  2  stage  of 
processing  where  a  new  set  of  classes,  Kjt  is  considered.  The  process 
should  stop  when  all  the  regions  are  labeled.  Figure  6  indicates  the 
recursive  nature  of  the  process.  Section  3.2  explains  how  the  labeling 
decisions  are  made. 

Several  comments  are  in  order.  In  thi6  study  only  one  level  of  seg¬ 
mentation  was  performed  where  the  classes  were  labeled  and  the  splitting 
criteria  tested.  However  the  fonaulation  of  the  segmentation  procedure 
will  be  presented  in  complete  generality.  Also  the  segmentation  process 
will  ultimately  need  to  be  extended  to  provide  for  a  global  optimization. 
This  extension  can  be  readily  added  to  the  methods  presented. 

3.1  Motivating  the  Statistical  Segmentation  Strategy 

Region  growing  methods  necessarily  utilize  clustering  techniques 
[36,37,36,39].  Typically  each  subregion  of  the  scene  is  characterized 
by  a  measurement  vector  x  ■  [x^x^. . .  where  xi  denotes  the  value 

of  measurement  i.  Such  a  vector  is  a  point  in  n-dimensional  Euclidean 
space,  En.  Intuitively,  measurement  vectors  computed  from  visually  similar 
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figure  6.  The  natural  pyramid  structure  of  the  segmentation,  (a)  An  example 
of  a  three  level  segmentation  where  level  0  corresponds  to  the 
whole  Image,  level  1  the  first  level  of  the  segmentation,  etc. 

(b)  The  regions  formed  during  the  segmentation,  (c)  A  tree  showing 
how  the  regions  of  (b)  were  formed. 


regions  should  lie  "close"  together  in  En  while  measurement  vectors  com¬ 
puted  from  visually  dissimilar  regions  should  lie  "farther"  apart.  Con¬ 
sequently*  measurement  vectors  computed  from  regions  containing  the  same 
class  should  form  a  cluster  in  En. 

The  proposed  segmentation  procedure  attempts  to  incorporate  the  useful 
attributes  of  both  supervised  and  unsupervised  clustering  approaches 
[39*40],  It  utilizes  some  knowledge  of  the  scene  by  allowing  one  to 

..  r 

**  r 

select  the  classes  to  be  considered  at  each  level  of  processing.  This 
is  accomplished  by  selecting  a  training  set  for  each  class  at  each  level 
where  it  is  to  be  considered.  Thus  measurement  selection  can  be  perform¬ 
ed  so  that  only  the  best  measures  need  be  used  in  doing  the  segmentation* 
a  mode  of  operation  allowed  only  by  supervised  procedures.  The  proposed 
procedure  provides  the  flexibility  to  determine  whether  a  region  is  com¬ 
posed  in  part  or  entirely  of  a  class  different  from  the  K  preselected 
classes  it  has  been  taught  to  recognize,  a  capability  usually  only  found 
in  unsupervised  procedures.  This  capability  to  detect  such  "unspeci¬ 
fied"  regions  is  an  Important  part  of  the  segmentation  process.  The 
procedure  also  provides  a  mechanism  for  identifying  regions  composed  of 
two  or  more  of  the  K  preselected  classes,  i.e.*  "boundary"  regions. 

These  capabilities  enable  it  to  split  such  unspecified  and  boundary 
regions  and  to  examine  the  resulting  smaller  regions  using  different 
classes  and  different  level  of  detail. 

The  proposed  segmentation  approach  would  seem  somewhat  similar  to  that 
presented  by  Chen  and  Pavlldis  [41,42].  In  this  case,  however,  a  more 
sophisticated  multivariate  formulation  for  the  decision  making  is  developed. 
3.2  FotmaLuing  the  Concept* 

To  formalize  the  above  concepts  a  number  of  definitions  are  useful. 
Definition  3.  At  a  particular  level  of  the  segmentation  process  a 
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region  R  is  said  to  be  uni^Ofun  if  it  is  composed  entirely  of  only  one 
of  the  K  classes  the  procedure  has  been  trained  to  recognize  at  that 
level. 

V efainction  4.  At  a  particular  level  a  region  R  is  said  to  be  a  boundasiy 
region  if  it  is  composed  of  two  or  more  of  the  K  classes  the  procedure 
has  been  trained  to  recognize  at  that  level. 

Ve.iirution  5.  At  a  particular  level  a  region  R  is  said  to  be  an  un&peCA.- 
&Ltd  if  any  part  of  R  contains  a  class  unknown  to  the  procedure  at  that 
level,  i.e.,  something  other  than  one  of  K  classes. 

The  segmentation  procedure  requires  that  given  a  region  R,  the  follow¬ 
ing  decisions  be  made. 

1)  Determine  if  R  is  either  an  unspecified  region  or  a 
boundary  region.  If  it  is  either  then  split  R  to 
form  higher  level  regions. 

11)  Determine  if  R  is  a  uniform  region.  If  so,  label  R  with 
one  of  the  K  possible  class  labels. 

The  capability  to  call  R  an  unspecified  region  provides  the  procedure 
the  ability  to  recognize  an  "unusual  looking"  region  and  not  to  force 
it  Into  one  of  K  known  classes. 

The  required  decisions  can  be  stated  as  a  series  of  hypothesis  tests. 
For  simplicity  consider  only  two  classes,  and  Given  a  region  R, 

one  can  define  the  following  hypotheses. 

Hq:  R  is  composed  entirely  of  class  u^. 

H^:  R  is  composed  entirely  of  class 

H^:  R  is  a  mixture  of  both  and  u^. 

R  is  composed  of  something  other  than  and/or  u^. 
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Note  that  for  hypothesis  H^.if  any  part  of  R  contains  something  other 
than  and/or  then  R  is  considered  as  not  containing  either  or 

“2- 


Determination  of  the  uniformity  of  a  region  involves  two  tests.  R  is 
uniform  if  either  hypothesis  Ho  or  is  accepted  in  Tests  1  and  2.  If  R 

is  uniform  then  Test  3  resolves  the  labeling. 

TEST  1:  versus  Ho  or  H^,  to  differentiate  uniform  regions 

from  unspecified  regions. 

TEST  2:  H,  versus  H  or  H, ,  to  differentiate  uniform  regions 
2o  1 

from  .boundary  regions. 

TEST  3:  Hq  versus  H^,  to  assign  a  class  label  to  an  uniform 

region. 

Methods  have  been  developed  for  performing  these  tests.  In  developing 
these  methods  an  assumption  was  made  concerning  the  nature  of  the  pro¬ 


bability  density  functions.  The  assumption  not  only  aids  in  developing 
the  tests  but  also  allows  a  parametric  approach  be  taken. 

Kb&imption  1.  Let  be  one  of  a  possible  K  classes  for  a  level. 

It  is  assumed  that  f(x|uj)  is  normal,  N(^,  Ij),  with  mean  and  convariance. 


3.3  The  UutXtcInu  Formulation*  oi  the  Hypothe*e*  Tati, 

The  formulations  for  hypothesis  Tests  1,2  and  3  given  above  can  be 
easily  generalized  to  cases  when  more  than  two  classes  are  involved.  This 
straight forward  generalization  requires  one  to  consider  density  functions 
f(x|uj),  J  "  1,...,K  all  of  which  exist  in  the  same  pattern  space. 

However,  rather  than  using  this  straightforward  formulation  a  modification  will 
be  used;  one  which  affects  the  way  each  test  will  be  conducted. 


The  motivation  for  the  modification  is  based  on  the  fact  that  the 


number  of  training  samples  available  limit  the  number  of  measurements  that 
can  be  used.  This  fact  stems  from  the  Hughes’  peaking  phenomena  [43]  and 
Foley  [44]  suggests  that  there  be  at  least  10  training  samples  available 
for  each  class  for  each  measurement  used.  This  restriction  poses  a  problem 
in  situations  where  there  is  a  class  with  significantly  fewer  training 
samples  than  the  other  classes.  In  typical  multiclass  procedures  the 
number  of  training  samples  available  for  such  a  class  determines  the  maxi¬ 
mum  number  of  measurements  which  can  be  used  to  define  all  the  classes. 

Usually  in  an  image  analysis  problem  the  quality  of  the  measurements 
are  such  that  a  measure  is  useful  in  discriminating  only  a  few  of  the 
classes.  Consequently,  a  number  of  measures  are  required.  Therefore 

a  restriction  on  the  number  of  measurements  imposed  by  a  single 
class  would  seemingly  adversely  effect  overall  performance.  A  better 
approach  would  be  to  subdivide  the  problem  into  a  number  of  independent 
decisions  wnere  the  effect  of  the  restriction  on  the  number  of  measurements 
will  be  minimized.  A  pairwise  approach  to  each  test  accomplishes  this 
objective. 

To  describe  the  pairwise  methods  used  to  implement  the  hypothesis 
tests  the  following  notation  will  be  used.  Let  be  the  class 

conditional  density  function  of  class  for  the  class  pair  decision  in¬ 
volving  a),  and  w.  .  Assume  f..  (x..  |w.)  is  normal,  N(j^  .  ,1  .).  To  aid  in 
understanding  the  notation,  the  subscript  jk  on  f  and  x  is  used  to  indicate 
the  density  functions  and  measurement  vector  involved  in  the  class  pair 
Uj,  (u^  decision.  The  subscript  J,k  on  jj_  and  Z  is  an  ordered  pair  where 
j  Indicates  the  class  Wj  of  which  ^  and  I  are  the  mean  vector  and  covariance 
matrix  and  k  denotes  the  other  class,  Involved  in  the  class  pair  decision 


K 

r 


\  v' 


R 


V.‘\ 

c--. 


To  determine  the  measurement  subset  which  defines  the  components  of 
for  each  class  pair  a  measurement  selection  procedure  is  used.  The 
algorithm  used  in  the  study  is  a  forward  sequential  search  (FSS)  algorithm 
which  Independently  selects  a  subset  of  the  available  measurements  to  use 
in  making  each  class  pair  decision.  At  the  mth  Iteration  the  algorithm 

g  £ 

augments  the  subset  selected  at  the  (m-1)  iteration  with  another  measure. 
This  measure  is  the  one  which  when  combined  with  subset  selected  at  the 
(m-1)  iteration  gives  best  probability  of  correct  classification. 

3.3.1  TEST  1:  Differentiating  Uniform  from  Unlspecified  Regions 

A  region  R  is  unspecified  if  either  R  is  composed  entirely  of  a 
new  class,  i.e.,  not  one  of  the  K  possible;  or  it  is  composed  of  a  com¬ 
bination  of  two  or  more  new  classes;  or  it  is  composed  of  a  combination 
of  one  or  more  new  classes  with  one  or  more  of  the  K  classes.  In  any 
event  the  distribution  of  x  computed  from  unspecified  regions  should  be 
different  from  the  distributions  defining  the  K  classes.  Consequently, 
this  hypothesis  test  becomes  merely  a  matter  of  determining  whether  x 
Is  a  member  of  any  of  the  populations  of  the  K  classes. 

A  standard  mechanism  for  performing  such  a  test  is  to  use  the 
Chi-squared  test.  In  a  pairwise  form  of  this  test,  R  is  considered  to 
be  an  uniform  region  if  for  at  least  one  class 
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1  ■  1,...,K,  1  4  J.  Otherwise  consider  R  an  unspecified  region.  Here 
2 

X  ■  100  a  percentage  point  of  the  Chi-squared  distribution  with 

*  Jt  2  2 

d4,  degrees  of  freedom,  a  -  Prob  (t  <  x„.j  ),  and  d..  is  the  dimensionality 
J* 

of  x^  [45].  Reference  46  gives  an  application  of  such  testing 
ture  analysis. 
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to  tex- 


It  should  be  noted  that  since  the  dimensionality  of  Xj£  may  vary 

2 

from  class  pair  to  class  pair,  one  needs  a  method  for  relating  t  ,  a  and 
2 
X 

°*dj£*  ^nterre^ations^iP  *s  given  by 


T-  -2(i-i) 

i  -  exp  (  — )  y  — iii - 

i-1  21  x  (i-1) ! 


for  d^  even 


W..-D/2 


T  mm  -  T  2  2  r  ^  i'2^  <p^2i-l) 

•  f  f(t)dV  -  [2  [  f  exp(-*s  r  )dr  -  exp(-*s  x  )  £  — - T 

V*  i  ‘-1  <2i>! 


for  dj ,  odd 


where 


(2w)*a|L.  Jdj£ 


d,./2  e 


and  dV  is  differential  of  volume 


2  2 

•  Note  that  in  the  above  y  j  ■  T  . 
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3.2.2  TEST  2:  Differentiating  Uniform  from  Boundary  Regions 

A  boundary  region  R  can  be  composed  of  combinations  of  2  classes, 

3  classes . K  classes.  It  can  be  argued  that  as  the  number  of  classes 

contained  in  a  boundary  region  increases  the  more  dissimilar  the  measures 
computed  from  this  region  will  be  from  any  of  the  K  known  classes.  Con¬ 
sequently  the  greater  the  probability  that  such  boundary  regions  will  be 
detected  as  unspecified  regions  by  Test  1.  The  more  difficult  problem 
is  in  differentiating  uniform  region  from  boundary  regions  containing 
only  two  classes.  This  differentiation  is  particularly  Important  since 
as  the  region  sice  gets  smaller  with  increasing  levels  the  basic  test 
differentiating  uniform  regions  from  boundary  regions  is  that  Involving 
boundary  regions  composed  of  only  two  classes.  Therefore,  the  objective 
of  Test  2  is  differentiating  uniform  regions  from  such  boundary  regions. 
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To  develop  Chls  test  lec  denote  a  boundary  region  R  composed  of 
and  w^.  Let  x  be  a  measurement  vector  computed  from  R.  Assume  that 
x  “  6*^  +  (l-g)x2  where  is  the  measurement  vector  computed  from  the 
1006  percent  of  R  which  contains  only  class  and  x2  be  the  measurement 


vector  computed  from  the  100(1-6)  per  cent  of  R  which  contains  only  class 
u>j.  Further  assume  that  for  b^  boundary  regions  6  is  uniformly  dis¬ 
tributed  0  <  6  <  1. 

Under  the  above  assumption  it  can  be  shown  that 


'  /0  fjk%k-6)'16 

where  fj^(Xjk)  is  the  density  function  of  x^k  computed  from  bjk  boundary 
regions  and  further  where 


fjk<»jk-»>  *»<*i.kw>>  £3.k(W)- 
4.k(e)  '  6%,k+  (1-  B)  4.3 • 

and 

£3.k(6)-B2l3.k+ll-B)Jlk.3- 

Given  the  above  the  obvious  pairwise  test  is  to  call  R  uniform  if  there 
exist  at  least  one  k  such  that 

^k^JkK*  P(“k)  >  fjk%klbjk)  P(bjk) 

for  J  •  1,...,K,  J  i  k  where  P(bjk>  denotes  the  a  priori  probability 
of  occurrence  of  boundary  regions  composed  of  and  Otherwise 
call  R  a  boundary  region. 

The  development  of  the  above  test  requires  the  estimation  of  the 
a  priori  probabilities,  P(b^) .  These  are  difficult  to  estimate  and  their 
values  can  affect  the  test  results.  In  addition,  this  test  is  computa- 
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t tonally  complex  since  it  requires  <*i)  numerical  integrations  to  evalu¬ 
ate  the  density  functions  f,.  (x.t). 

jK  JK 

Because  this  test  is  computationally  complex  and,  moreover,  yields 

only  approximate  decision  surfaces,  it  was  decided  to  use  this  test  only  to 
provide  intuition  into  developing  a  simplier  test;  one  which  does  not 

require  the  evaluations  of  the  density  functions,  f^Oc^).  The  simpler 

test  is  call  R  uBiform.if  there  exists  at  least  ene  k  such  that 

_flk(4jkjh),i)  <  r 

for  j  “  j  ¥  k  and  r,  a  preselected  number,  0  <  r  <  1. 

Otherwise  call  R  an  boundary  region.  This  test  is  referred  to  as 
the  "ratioing"  test. 

3.3.3  TEST  3:  Labeling  Uniform  Regions 

A  pairwise  Bayesian  classification  scheme  will  be  used  to  label  the 
unisons  regions.  Such  a  procedure -is  an  optimal  strategy  for  minimizing 
the  average  misdassification  error  for  each  pairwise  problem  [47].  The 
pairwise  method  involves  solving  K  (K  -  l)/2  separate  subproblems  each 
Involving  a  class  pair  instead  of  one  K  class  problem  directly.  In  the 
procedure  used  in  this  study  the  class  conditional  density  functions 

used  1x1  ®«kiag  each  class  pair  decision  are  assumed  to  be 
normal.  Further  the  a  priori  probabilities  of  the  classes  are  assumed 
to  be  equal.  The  results  of  each  class  pair  decision  are  tallied  using  a 
polling  function.  The  measurement  vector  is  assigned  to  the  class  which 
rejected  the  vector  the  fewest  number  of  times. 

For  a  more  complete  description  of  these  tests  the  reader  is 
referred  to  [48]. 
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4.  SELECTION  OF  SCENE  DEPENDENT  PARAMETERS 


In  any  segmentation  method  there  are  certain  scene  dependent  para¬ 
meters  which  must  be  input  into  the  algorithm.  For  each  level  to  be  con¬ 
sidered  in  the  proposed  segmentation  algorithm  the  following  must  be 
specified: 

a)  the  K  classes,  to  be  considered, 

b)  the  size  of  the  regions,  R  to  be  used, 

c)  the  number  of  displacement  vectors  to  be  used  in  the  computing 
the  cooccurrence  matrices  and 

d)  a  method  for  selecting  the  training  samples  from  the  image. 

In  this  study  only  one  level  was  considered  and  hence  only  one  set 

of  parameters  had  to  be  selected.  However,  similar  reasoning  to  that 
presented  should  be  applicable  to  any  level.  The  level  selected  for  this 
study  should  be  considered  as  a  low  level  in  the  pyramid  data  structure, 
see  Figure  6,  because  of  the  region  size  used  and  also  because  of  the  amount 
of  detail  present  in  the  classes. 

4.1  Selection  o£  the  Chum  £ot t  Segmentation 

Three  criteria  were  established  for  selecting  the  classes. 

1)  The  classes  should  correspond  as  directly  as  possible  to  the 
M,  C  &  G  tangible  features. 

2)  Each  class  should  comprise  a  large  enough  region  of  the  image  that 
an  adequate  training  set  can  be  selected. 

3)  The  classes  were  chosen  such  that  if  represents  the  regions  of 
the  image  labeled  class  u. /then 


should  be  the  whole  scene.  Further 
RJ  n  Rk  -  *,  J  y  k. 

Criterion  1,2  and  3  led  to  the  selection  of  the  classes  in  Table  1. 

An  important  point  that  should  be  made  concerning  the  runway,  vehicle 
parking,  aircraft  parking  and  multilane  highway  classes  is  that  all  these 
classes  are  composed  of  the  same  micropatterns,  namely,  a  large  paved 
area  surrounded  by  dry  land.  This  might  lead  one  to  think  that  it  would 
be  difficult  for  texture  analysis  methods  to  discriminate  them.  However, 
one  should  note  that  there  are  clues  that  can  be  used  by  the  textural 
analyzer.  A  vehicle  parking  area  can  be  discriminated  from  the  other 
classes  because  of  the  presence  of  cars  parked  in  a  regular  fashion.  If 
no  cars  are  present  the  stripes  used  to  delineate  the  parking  places  can  be 
used  to  make  the  determination.  However,  if  neither  of  these  signs  are 
present  and  only  a  concrete  slab  is  visible  then  the  discrimination  would 
not  be  possible  based  solely  on  textural  information.  Similarly  with  the 
aircraft  parking  area,  if  planes  are  not  present  and  parked  in  a  regular 
fashion  then  textural  information  cannot  be  UBed  by  itself  to  discriminate 
this  class.  Runways  and  multilane  highways  can  be  discriminated  because  of 
the  difference  in  the  width  of  the  paved  area  comprising  them.  Highways, 
even  those  with  several  lanes,  are  not  usually  as  wide  as  runways  or 
taxlways  at  airports.  Also,  the  natural  wear  marks,  i.e.,  tire  marks, 
oil  marks  etc. ,  are  different.  All  these  indicators  can  be  detected  by 

S 

K 

texture  algorithms. 

The  dry  land  class  represents  a  combination  of  five  reasonably  diverse 
M,  C  A  C  tangible  features.  Many  of  these  M,  C  A  C  features  made  up  such 


a  small  area  of  the  image  that  they  could  not  be  considered  as  separate 
classes.  For  example,  there  were  only  two  small  mineral  piles  and  while 
there  were  19  sited  occurrences  of  deciduous  woodland,  the  total  area  occupied 
was  very  small.  Similarly,  the  total  area  of  crop  (cultivated)  land  in  the 
scene  was  very  small  and  seemed  inadequate  for  obtaining  a  satisfactory 
training  set.  Finally,  while  there  was  a  reasonably  significant  amount  of 
levee,  a  decision  was  made  not  to  consider  levee /embankment/  fill  as  a 
separate  class.  Given  that  the  mineral  pile,  deciduous  woodland,  crop 
(cultivated)  land  and  levee/embankment/fill  features  would  not  be  treated 
as  separate  classes,  it  remained  to  determine  with  which  class  each  should 
be  merged.  The  most  obvious  selection  seemed  dry  land. 

The  most  diverse  class  is  the  coomercial/lndustrial  class.  It  is 
composed  of  a  total  of  twenty  different  M,  C  &  G  features.  Each  feature 
comprising  this  class  did  not  seem  to  occupy  enough  area  of  the  image  to 
be  considered  a  separate  class.  These  twenty  features  were  all  combined 
into  one  class  because  no  smaller  groupings  of  them  could  be  found  which 
resulted  in  more  homogeneous  visual  classes  and  which,  at  the  same  time, 
provided  an  adequate  set  of  training  samples. 

Ten  of  the  47  M,  C  4  G  features  appearing  in  the  image  defy  categoriza¬ 
tion  in  the  nine  class  scheme  chosen.  Most  of  these  features  represent 
"small"  objects  which  usually  occur  as  stand  alone  entities.  Examples 
are  mine  shaft  structure,  display  sign,  radio/TV  antenna,  power  trans¬ 
mission  line  etc.  features.  Because  these  features  occurred  so  infre¬ 
quently  and  comprised  such  a  small  area  of  the  image,  they  were  not  con¬ 
sidered  in  the  segmentation. 

4.2  Selection  oi  the  Region  Size 

For  each  level  of  segmentation  there  is  a  strong  interrelationship 
between  the  size  of  the  region  R  and  the  classes  which  can  be 


31 


considered.  For  each  class  the  training  samples  for  this  class  must 
be  entirely  composed  of  The  larger  R  the  more  difficult  it  is  to  find 

•uch  training  samples.  Consequently  the  larger  R  the  fewer  the  number,  K, 
of  classes  that  can  be  considered  since  each  class  must  occupy  a  larger  area 
of  the  scene. 

The  consnerclal/industrial  class  effectively  established  a  lower  bound 
on  the  region  size  that  could  be  used  since  the  "unit  pattern"  of  the 
commercial/industrlal  class  is  substantially  larger  than  the  unit  patterns 
of  tne  other  classes  considered.  For  example,  see  Figures  7  and  8. 

The  region  size  selected,  was  145  x  145.  It  was  a  compromise  between 
two  opposing  requirements.  First,  one  would  like  the  size  of  R  to  be  as 
small  as  possible  so  that  a  fine  segmentation,  i.e.,  the  boundaries  between 
classes  can  be  accurately  determined.  Unfortunately  as  should  be  clear 
from  Figures  7  and  8  the  smaller  the  region  size  the  greater  the  probability 
of  misdassification.  The  second  requirement,  to  have  each  region  be  labeled 
as  accurately  as  possible,  forces  larger  region  sizes  to  be  used.  The  size 
chosen  was  estimated  by  overlaying  various  region  sizes  on  the  image  and 
selecting  the  size  which  appeared  to  be  larger  than  the  largest  of  the  unit 
patterns  for  the  various  classes  considered. 

4.3  SeJLecJUon  oi  the  Training  Sampler 

Ground  truth  information  was  provided  which  subdivided  the  scene 
into  areas  corresponding  to  the  M,  C  &  G  features.  These  data  were  translated 
into  the  nine  class  form  using  the  correspondences  provided  in  Table  1. 

Figure  9  shows  examples  of  the  training  samples  selected.  Mote  each 
.training  sample  is  145  x  145. 
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consercial/indus trial  area. 
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to  "effectively"  increase  the  total  number  of  training  samples,  over¬ 
lapping  was  permitted,  i.e. ,  S^nS^  was  not  required  to  be  $  where  Snis 

cample  n  of  class  to.  and  is  sample  m  of  class  to . .  If  n  4  $  then 

j  m  j  n  m 

the  area  of  the  intersection  was  alway  less  than  half  the  area  of  or 

n 

S^,  i.e.,  145  x  145.  An  example  of  the  overlapping  i6  shown  in  Figure  10. 

The  fact  that  the  training  sample  regions  overlap  might  seem  at  first 
glance,  a  hit  disquieting.  However,  overlapping  is  desirable  because  one 
can  not  always  be  assured  how  the  grid  cells  will  be  located  in  the  testing 
phase  of  a  study.  Obviously  in  the  cases  where  sufficient  data  is  avail¬ 
able  for  training,  no  overlapping  is  required. 

4.4  Selection.  ol  the  Displacement  Vectou 

Recall  the  displacement  vector  6  is  a  parameter  in  the  algorithm. 

For  each  value  of  6  a  cooccurrence  matrix,  S(6,T),  is  computed.  Each 
displacement  vector,  6  *  (d,6),  has  two  components  d  and  0  where  d  is 
the  intersample  spacing  distance  and  0  is  the  directional  orinetation. 

The  0  parameter  provides  directional  sensitivity  and  investigators  have 
usually  assumed  that  only  four  values  of  0  were  needed,  0*0°,  45°,  90s, 

135°  [1,2,3,4,5,7,8,22].  Also,  previous  work  [6,7]  usually  involved  at 
most  two  values  of  d  namely,  d  ■  1,2,  for  a  total  of  eight  different  values 
of  6. 

Recent  work  [1,2,9,30,31]  however,  has  Indicated  that  the  discriminatory 
power  can  be  improved  by  considering  more  values  of  6.  However,  from  a 
practical  point  of  view  the  number  of  values  Bust  be  limited.  The  problem 
then  is  to  select  a  relatively  few  (  values  which  will  allow  good  discrimi¬ 
nation.  Since  there  is  little  theory  on  the  subject, of  'selecting  the 
£  values,  a  heuristic  method  was  employed. 

First  the  values  of  0  were  selected.  This  was  done  by  determining 
the  orientations  of  the  majority  of  the  streets  appearing  in  the  scene. 
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It  should  be  observed  that  most  manmade  urban  structures  tend  to  be 
aligned  with  the  streets.  Thus ,  the  orientations  of  the  streets  are 
Important.  Note  that  the  lower  half  and  the  upper  right  part  of  the 
Figure  1  have  streets  which  are  aligned  in  the  75*  and  165*  directions. 

The  upper  left  part  of  the  image  has  streets  aligned  in  the  0*  and  90* 
directions.  Finally,  the  airport  has  runways  aligned  in  the  19*  and 
109*  directions.  Hence,  the  6’s  considered  were  0*,  19*,  75*,  90*, 

109”,  and  165*.  Obviously  using  such  scene  specific  information  as  the 
orientation  of  the  streets  tends  to  limit  the  generality  of  the  segmenta¬ 
tion  results. 

The  other  component  of  the  period  parallelogram  in  the  texture  model 
besides  the  directions  are  the  magnitudes.  In  order  to  determine  if  urban 
land  use  data  fit  thi6  texture  model  representative  areas  from  each  of  the 
nine  classes  were  examined.  Figure  11  shows  a  representative  area  selected 
for  the  residential  class.  From  each  of  the  representative  areas  the 
texture  measures  were  computed  for  d  »  1,  .  ..,80  and  for  6  values  which 
reflected  the  major  structural  character  of  the  pattern.  Usually  only 
two  6  values  were  considered  for  each  of  the  representative  regions  and  these 
6  values  were  usually  90*  apart.  This  reflects  the  geometry  of  manmade 
structures. 

Plots  were  generated  showing  the  variation  in  the  measurement 
values  as  a  function  of  d  for  a  given  6.  Examples  of  two  such  plots  for 
the  inertia  measure  are  shown  in  Figure  12.  A  comparison  of  these  plots 
for  the  nine  representative  regions  allows  determination  of  those  d 
values  which  provide  good  discrimination.  Consider,  for  example,  the 
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Figure  11.  (a)  Residential  area  and  (b)  mobile  home  area  used  to  compute 

the  nominal  'values  of  the  texture  measurements.  Rote  these 
regions  were  not  required  to  be  145  x  145  but  Just  homogeneous 

areas. 
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plots  in  Figure  12(a)  and  12(b).  Observe  that  there  is  a  significant 
difference  in  the  inertia  neasure  values  of  d  between  16  through  22. 
Consequently,  d  values  of  16  and  20  should  provide  good  discrimination. 

Mote  d  values  of  17,18  and  19  would  also  provide  good  discrimination 
between  these  two  classes.  Similar  consideration  of  all  the  plots 
resulted  in  the  selection  of  eight  values  of  d,  d  *  1,2,4,6,8,12,16,20. 

Hence  the  total  number  of  6  values  considered  Initially  was  48,  eight  d 
values  and  six  6  values. 

There  are  some  interesting  observations  which  can  be  made  about  the 
plots  in  Figure  12.  First,  note  that  the  local  minimum  occurring  at  d  *  74 
in  Figure  12(a)  corresponds  to  the  distance  between  the  streets  in 
Figure  11(a).  Further  the  local  minimum  occurring  at  d  *  38  corresponds 
to  the  distance  from  the  center  of  a  street  to  the  back  property  line  of 
a  residential  lot.  Plots  of  the  inertia  measure  in  the  75“  direction 
show  that  the  Inertia  measure  can  also  be  used  to  gauge  the  width  of  the 
lots.  Hence  the  inertia  measure  can  determine  the  average  lot  size  in 
the  subdivision  pictured,  i.e.,  the  period  parallelogram  unit  pattern 
of  the  residential  area.  Such  Information  could  theoretically  be  used  to 
determine  the  area  which  has  to  be  searched  for  objects  such  as  cars, 
driveways,  etc.  Similarly  plots  in  Figure  12(b)  show  that  the  inertia 
measure  can  be  used  to  gauge  structural  information  from  the  mobile  home 
area  of  Figure  11(b). 

The  last  example  which  shows  how  structural  Information  can  be 

gauged  is  the  vehicle  parking  area  jhown  in  Figure  13.  Figure  14  shows 

/ 

-the  basic  structure  of  this  area  And  gives  the  number  of  pixels  between 
the  various  elements  of  the  scene.  Figure  15  shows  plots  of  both  the  local 
homogeneity  measure  and  the  Inertia  measure  computed  along  the  6  ■  75“ 
direction.  Mote  that  the  local  homogeneity  measure  can  be  used  to  determine 


cle  parking  area.  Also  given  is 
between  elements  of  the  scene. 


the  distance  between  the  rows  in  the  parking  lot  while  the  Inertia  measure 
cannot.  Further  it  is  interesting  to  note  that  the  power  spectrum  cannot 
be  used  to  detect  this  structure  either  since  the  power  spectrum  and  inertia 
aeasure  are  essentially  equivalent  [2,31]. 

Given  the  fact  that  the  texture  measures  gauge  visually  perceivable 
qualities  of  patterns  the  process  described  above  for  selecting  the  6 
values  allows  one  to  incorporate  the  visual  differences  among  the  classes 
into  the  segmentation  process. 


5.  A  ONE  LEVEL  SEGMENTATION  OF  THE  SCENE 


In  this  section  studies  are  described  which 

.1)  evaluate  the  capabilities  of  the  texture  measures 
to  characterize  the  land  use  classes; 

11)  evaluate  the  capabilities  of  the  pairwise  classification 

procedure  to  label  the  uniform  regions,  i.e.  ones  which  con¬ 
tain  only  one  land  use  class;  and 

lii)  estimate  the  performance  of  the  proposed  seg¬ 
mentation  procedure. 

These  studies  resulted  in  a  one  level  segmentation  of  the  scene.  Note 
the  capabilities  of  Chi-Squared  test  and  ratloing  tests  used  in  differentiat¬ 
ing  uniform  regions  from  those  which  should  be  split  are  given  in  Section  6. 
5.1  CapabiJUttu  o£  the.  Texture  M&cu>uA£j>  to  ChasiacteJUze  Land  Lite  CZomoa 
The  best  method  for  evaluating  the  capabilities  of  the  texture  measures 
is  to  determine  how  well  they  can  discriminate  the  land  use  classes.  . 
Commonly  the  training  results  are  used  to  estimate  this  discriminatory 
power.  To  acquire  the  training  results  requires 

I)  the  extraction  of  the  texture  measures  from  the 
training  samples; 

II)  measurement  selection  to  obtain  the  measures  that 
should  be  used  to  make  each  class  pair  decision; 

ill)  estimation  of  the  necessary  mean  vector  and  covariance 
matrices;  and 

lv)  the  application  of  the  pairwise  classification  pro¬ 
cedure  to  the  texture  measures  extracted  from  the  training 
samples. 

The  ■uurlnmn  number  of  measurements  which  could  be  used  to  make  each  class 
pair  decision  was  derived  using  the  Foley  criterion  [44]  with  a  minimum 
of  8  samples  In  each  class  for  each  measure  used.  The  measurement  sele- 
tlon  method  was  a  forward  sequential  search  procedure  [5).  It  is  worth 
observing  that  3  of  the  5  measures  selected  in  the  residential/moblle  home 


46 


class  pair  decision  involved  d  values  of  16  and  20.  Note  that  this  agrees 
with  the  observations  concerning  the  plots  of  Figure  12. 

The  training  results  for  this  study  are  given  in  Table  2.  The 
percentage  of  overall  correct  classification  obtained  was  90Z.  This 
figure  was  computed  by  dividing  the  total  number  of  correctly  classified 
regions  by  the  total  number  of  training  samples. 

A  few  observations  concerning  the  training  results  should  be  made. 

First,  it  should  be  noted  that  the  coimnercial/industrial  class  and  the  dry 
land  class  represent  the  two  most  heterogeneous  classes.  That  is,  there 
are  a  number  of  visually  dissimilar  areas  comprising  each  of  these  classes. 
Consider,  for  example,  the  dry  land  class.  This  class  is  composed  of  grass 
land,  mixture  of  grass  and  trees,  cultivated  areas  with  prominent  rows  and 
areas  of  exposed  soil  with  little  or  no  grass.  Similarly,  the  commercial/ 
industrial  class  is  composed  of  such  heterogeneous  possibilities  as  a 
drive-in  theater  screen,  wind  tunnels,  apartments  and  railroad  station/depot. 
The  heterogenous  make  up  of  these  classes  shows  Itself  in  the  training  results 
from  the  fact  that  members  of  most  of  the  other  seven  classes  were  misclassi- 
fled  into  one  or  both  of  these  classes. 

Another  point  of  interest  about  the  training  results  is  that  the 
majority  of  the  Incorrect  classifications  were  graceful  misses.  That  is. 

In  the  majority  of  instances  when  a  misclassificatlon  occurred  the  class 
in  which  the  sample  was  incorrectly  placed  was  a  good  second  choice  for 
placing  that  sample.  Consider  for  example  the  residential  samples  that 
were  misdassified.  These  samples  were  all  labeled  coBoercial/industrial. 
Residential  areas  look  more  like  coonerclal/lndustrial  areas  than  any 
of  the  rest  of  the  classes.  Further  consider  the  misdassified  mobile 
home  samples.  These  samples  were  placed  in  the  residential  or  conmercial/ 
industrial  classes.  Again,  both  of  these  classes  represent  better  alternatives 
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2  0  322 


41  87.802 


25  72.002 


10  353  91.222 


85  98.822 


78  89.742 


21  85.712 


99  121  81.822 


total  65)341  40 1  19  356  85  73  18  118  1135 

OVERALL  PERCENTAGE  CORRECT  CLASSIFICATION  90.042 


RES  -  Residential 

COM  -  Commercial/Industrial 

MH  -  Mobil  Hone 

VPK  -  Vehicle  Parking  Area 

DL  -  Dry  Land 

WTR  -  Water 

RW  -  Runway /Taxiway 

APK  -  Aircraft  Parking 

MHW  -  Multilane  Highway 

Table  2.  The  results  obtained  by  classifying  the 
training  aamples.  The  training  results 
gave  an  902  overall  correct  classification 


than  any  of  the  other  classes.  Also  the  vast  majority  of  the  misclassif ied 
vehicle  parking  area  samples  were  called  commercial/industrial  area  samples. 
Mote  that  many  of  the  coamercial/industrial  samples  have  large  parking  areas 
contained  In  them. 

Also  consider  the  confusion  between  the  commercial/industrial  class 
and  the  dry  land  class.  As  was  stated  earlier  both  the  conmercial/indus- 
trial  and  dry  land  classes  are  heterogeneous  in  structure.  Further,  almost 
every  class  has  samples  which  are  misclassif  ied  into  one  of  these  two  classes. 
The  confusion  between  these  two  classes  seems  to  be  based  on  a  similar 
structure.  Many  of  the  commercial/industrial  training  samples  have  one  or 
store  buildings  surrounded  by  a  large  grassy  area.  Evidentially,  the  methods 
employed  have  trouble  discriminating  areas  of  bare  soil  surrounded  by  grass 
from  buildings  surrounded  by  grass . 

Finally  both  the  dry  land  class  and  commercial /Indus trial  class  are 
confused  with  the  multilane  highway  class.  To  explain  this  confusion  it 
should  be  observed  that  a  145  x  145  sample  of  multilane  highway  must  contain 
things  other  than  a  multilane  highway.  In  the  majority  of  the  training 
samples  of  multilane  highway  the  other  objects  in  the  145  x  145  areas 
were  either  coonerclal/industrial  areas  and/or  dry  land. 

These  results  are  encouraging.  Even  though  some  misses  occur 
they  indicate  that  texture  measures  can  be  used  to  characterize  compli¬ 
cated  land  use  classes. 
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5.2  CapabiUtieA  oi  the  Paiwi&e  Cl&ui£iuvUon  PfioceduAe 

A  procedure  which  seems  appropriate  for  evaluating  the  classification 
scheme  Is  to  partition  the  image  into  regions  as  though  a  segmentation  were 
to  be  performed  and  then  classify  only  those  regions  which  are  uniform. 

In  this  case  the  regions  are  created  by  overlaying  a  145  x  145  square  grid 
on  the  image  as  shown  in  Figure  16.  The  resulting  1156  regions  include 
680  which  have  more  than  901  of  their  area  composed  of  one  of  the  nine 
classes.  In  the  study  the  definition  of  uniformity  was  relaxed  to  assure 
an  adequate  number  of  samples. 

The  results  of  applying  the  pairwise  classification  scheme  to  the  680 
regions  are  shown  in  Table  3.  Observe  that  these  results  are  comparable 
to  the  training  results  given  in  Table  2  with  the  most  significant  difference 
being  only  8%  below  tne  training  results.  In  making  this  comparison  it  is 
important  to  point  out  that  seldom  if  ever  did  a  training  sample  coincide 
with  one  of  the  680  regions  considered.  Further  it  should  be  observed 
that  three  classes,  namely,  vehicle  parking,  runway /taxiway  and  aircraft 
parking,  have  higher  percentages  of  correct  classification  than  those  achieved 
during  training.  This  anomaly  can  be  explained  for  the  vehicle  parking 
and  aircraft  parking  classes  by  noting  In  Table  3  the  small  number  of  samples 
representing  these  two  classes.  To  explain  the  discrepency  for  the  runway/ 
taxiway  class  note  the  fact  that  the  positioning  of  the  paved  runway  area 
within  the  145  x  145  sample  region  will  affect  the  values  of  the  texture 
measures.  Ideally  one  would  like  the  runway  to  pass  through  the  center 
of  the  region.  However  to  adequately  train  the  procedure  to  recognize 
this  class,  samples  reflecting  less  than  ideal  conditions  were  included. 
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VERIFIED  CLASSIFICATION 


RES  -  Residential 

COM  -  Coomercial/Industrial 

MH  -  Mobil  Bone 

DPR  -  Vehicle  Parking  Area 

DL  -  Pry  Land 

VTR  -  Water 

RW  -  Runway/Taxiway 

APK  -  Aircraft  Parking 

MHV  -  Multilane  Highway 

Table  3.  The  results  obtained  by  applying  the  pairwise  classification 
scheme  to  660  uniform  regions  of  the  Image. 
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Consequently  the  difference  in  classification  accuracies  is  attributable 
to  the  good  runway  sample  region  alignment  occurring  just  by  chance  in 
this  study. 

Finally  it  should  be  observed  that  the  misdassificatlons  shown  in 
Table  3  are  very  similar  to  those  given  in  Table  2.  The  only  exception 
being  the  five  conmercial/industrlal  samples  which  were  mlsclassified  in 
runway /taxiway  class.  To  explain  this  note  that  four  of  five  areas  mis- 
dasslfled  in  this  manner  involved  large  hangars.  According  to  Table  1 
hangars  are  a  part  of  the  commercial/industrial  class.  Yet  the  large  han¬ 
gars  involved  in  misdassificatlons  look  nothing  like  the  commercial /indus- 
tlral  samples  included  in  the  training  set  for  this  class.  This  fact 
together  with  the  visual  similarity  of  regions  containing  hangars  to  those 
containing  a  runway  explains  the  confusion.  This  can  be  observed  from  Figure  17. 
5.3  A  Segmentation  oi  the  Scene 

Practically  speaking  in  any  automatic  segmentation  procedure  there  is 
a  level,  say  level  1,  at  which  a  labeling  of  every  remaining  region  must 
be  forced.  Usually  the  percentage  of  the  regions  which  are  unspeci¬ 
fied  and  boundary  decreases  with  increasing  i  since  the  smaller  the  regions 
B,  the  store  of  them  that  will  fail  into  areas  composed  of  a  single  class. 
Admittedly  there  are  mitigating  factors,  but  a  reasonable  estimate  of  the 
lower  bound  on  the  performance  of  a  segmentation  can  be  derived  by  forcing 
a  label  on  the  regions  at  the  lowest  possible  level,  level  1.  This  follows 
from  the  fact  that  the  percentage  of  unspecified  and  boundary  regions 

will  be  maximum  at  this  level.  Such  a  lower  bound  will  be  estimated  in 

/ 

this  section.  The  data  used  to  sttablish  this  lower  bound  are  the  1156 
145  x  145  regions  shown  in  Figure  16. 


In  forcing  a  label  on  these  regions,  a  difficulty  that  is  imnediately 
encountered  relates  to  establishing  ground  truth.  For  example,  it  is 
possible  for  some  of  these  regions  to  contain  multiple  land  use  classes. 

To  avoid  any  possible  ambiguity  rules  were  formulated  for  scoring  the 
segmentation.  Let  R  be  a  145  x  145  square  in  question.  Further  let 
Uj  represent  class  j  and  A(u^)  represent  the  area  of  R  occupied  by  class 
uy  Then  the  scoring  rules  are  as  follows: 

1.  If  R  contains  n  classes,  say  u^,...,un,  and  if  A(u)j)  »  A(u^) 
for  all  k  such  that  1  <  R  k  ^  j,  then  the  correct  labeling 
of  R  is  assumed  to  be  Uj . 

2.  If  R  contains  n  classes,  say  u^,...,u>n,  and  if  there  are  m  classes, 
say  w.  ,uu, . . .  ,u  ,  such  that  A(u>. )  *...sA(u  )  and  further  that 

JL  4  m  1  IB 

A(u)j)  >  A(u>k),  1<.J  <_  m,  m  .<  k  £  n,  then  the  correct  labeling 
is  assumed  to  be  anyone  of  the  ,  1  <.  j  < 

The  results  obtained  by  pairwise  classification  procedure  to  label 
all  the  1156  regions  is  given  in  Table  4.  In  examining  these  results  it 
is  again  important  to  note  that  seldom  if  ever  did  a  training  sample 
coincide  with  one  of  the  regions  shown  in  Figure  16.  The  overall  per¬ 
centage  of  correct  classification  is  about  83X. 

As  will  be  observed  six  of  the  nine  classes  have  good  classification 
accuracies,  ones  which  compare  favorably  to  those  obtained  when  only  uniform 
regions  are  considered  (Table  3).  However,  three  of  the  classes,  namely, 
mobile  home,  vehicle  parking  and  aircraft  parking,  do  not.  To  explain 
these  low  classification  accuracies  it  is  Important  to  note  that  when  only 
.uniform  regions  are  considered  these  classes  have  high  probabilities  of 
correct  classification.  Given  that  each  of  these  classes  compose  only  a 
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VCfttFCO  CLASSIFICATION 


RES 

- 

Residential 

COM 

- 

Commercial/Industrlal 

MH 

- 

Mobil  Home 

UPK 

- 

Vehicle  Parking  Area 

DL 

- 

Dry  Land 

WTR 

Water 

RW 

- 

Runway/Taxlvay 

APK 

- 

Aircraft  Parking 

MHW 

- 

Multilane  Highway 

Table  4.  The  results  obtained  when  the  image  was 
segmented.  It  should  be  noted  that  few 
If  any  of  the  regions  used  In  the  segmentation 
exactly  corresponded  to  the  regions  comprising 
the  training  set. 


•mall  percentage  of  the  whole  scene  the  obvious  explanation  for  these  poor 
results  is  that  nost  of  the  regions  misdassified  were  actually  boundary 
regions.  An  examination  of  the  data  proves  this  explanation  to  be  correct. 

The  percentage  of  correct  classification  of  the  other  classes  seemed 
acceptable.  As  will  be  observed  the  algorithm  failed  gracefully  on  a 
majority  of  the  mis class if led  samples.  Consider  the  residential  class. 

The  majority  of  the  misdassified  residential  samples  were  labeled  as 
commercial/industrial.  Clearly  if  a  residential  area  is  to  be  missed, 

"  r' 

the  commercial/industrial  class  is  the  one  which  most  closely  resembles  the 
residential  class. 

The  vast  majority  of  the  misdassified  commercial /Indus trial  samples 
were  called  either  residential,  dry  land  or  multilane  highway.  Similarly,  the 
vast  majority  of  the  misdassified  dry  land  samples  were  called  commercial/ 
industrial. 

This  leads  one  to  the  water  class.  As  will  be  observed  from  Table  4 
some  water  samples  are  mislabeled  as  commercial/industrial,  dry  land, 
runway  and  aircraft  parking.  These  are  dearly  very  bad  misses.  However, 
there  is  a  mitigating  factor.  All  the  water  samples  which  were  mis¬ 
dassified  Into  either  the  commercial/industrial  class,  the  runway /taxiway 
class,  or  the  aircraft  parking  class  contained  a  levee,  i.e.,  an  unspeci¬ 
fied  class,  since  none  of  the  training  samples  for  any  of  the  classes 
contained  a  levee.  Finally  some  of  the  confusion  between  the  water  class 
and  the  dry  land  dass  seems  to  be  caused  by  mixtures  of  dry  land  and  water. 
Hater  samples  which  contained  only  reasonably  small  areas  of  dry  land 
.were  consistently  called  dry  land  by  the  algorithm. 

The  runway  class  was  very  accurately  classified  with  only  two  of  the 
68  samples  being  misdassified  as  dry  land.  The  multilane  highway  class 


had  samples  misdassified  as  commercial/industrial,  residential  and  mobile 
home.  It  is  interesting  to  note  that  each  of  the  multilane  highway  training 
samples  contained  areas  from  these  three  classes  and  dry  land.  However,  only 
the  commercial/industrial,  residential  and  mobile  home  areas  contained 
structures  which  could  markedly  affect  the  texture  measures  ability  to  detect 
the  presence  of  a  major  road. 


6.  EVALUATING  THE  SPLITTING  CRITERIA 

Recall  that  the  splitting  criteria  involves  two  tests,  the  Chi-Squared 
test  and  the  ratioing  test.  The  Chi-squared  test  was  formulated  primarily  to 
differentiate  uniform  regions  from  unspecified  regions  whereas  ratioing 
was  primarily  formulated  to  differentiate  uniform  regions  from  boundary 
regions.  In  this  section  studies  aimed  at  evaluating  these  tests  will  be 
briefly  described. 

6.1  Evacuating  the  Capabilities  o£  Chi- Squared  Test 

The  data  used  to  evaluate  Chi- Squared  test  were  taken  from  a  square 
area  in  the  extreme  upper  right  hand  corner  of  the  image.  This  area 
contained  144  regions  and  comprised  the  last  12  elements  of  the  first 
12  rows  of  the  grid  given  in  Figure  16.  This  area  was  selected  because 
it  contained  a  levee  and  a  land/water  boundary.  The  levee  is  of  interest 
because  while  being  a  part  of  the  dry  land  class  any  145  x  145  region  con¬ 
taining  a  levee  must  necessarily  contain  water.  Also  the  training  set 
for  the  dry  land  class  contained  no  levee  samples.  Hence  the  segmenta¬ 
tion  procedure  should  call  such  regions  unspecified.  The  presence 
of  the  land /water  boundary  could  be  used  to  determine  whether  Chi-squared 
test  could  be  used  to  find  boundary  regions. 

The  results  of  applying  a  simplified  version  of  the  Chi-squared  test 
to  the  144  regions  is  summarized  in  Table  5.  The  first  row  of  this 
table  shows  how  the  53  verified  unspecified  regions  were  handled  by 
the  segmentation  procedure.  Observe  that  44  (831)  of  these  were  correctly 
Identified  as  being  unspecified  by  the  Chi-squared  test.  Of  the  9 
..regions  which  were  mistakenly  judged  as  being  uniform,  7  of  these  were  ^ 
correctly  classified  by  the  pairwise  classifier. 


COMPUTER  RESULTS 
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Tabic  5:  A  sumaary  of  the  result*  obtained  by  applying  the  simplified 
fora  of  the  Chi-squared  test  to  the  144  regions  contained  in 

a  square  area  in  the  extreme  upper  right  hand  side  of  the  scene 


The  second  row  of  Table  5  shows  how  the  verified  boundary  regions 
were  handled.  Observe  that  the  Chi-squared  test  correctly  Identified  11  (372) 
of  the  30  regions  as  boundary  regions.  Of  the  19  regions  which  were  not 
Identified  as  boundary  regions  by  the  Chi-squared  test,  17  (89Z)  of  these 
were  labeled  by  the  pairwise  classifier  as  belonging  to  one  of  their  major 
constituent  classes.  Therefore,  only  2  out  of  30  samples  can  be  viewed 
as  Incorrectly  labeled.  It  should  be  noted  that  the  Chi-squared  test  is 
specifically  derived  to  differentiate  unspecified  regions  froa  uniform 
regions  and  the  correct  differentiation  of  boundary  regions  from  uniform 
regions  is  really  a  secondary  consideration  for  this  test. 

The  third  row  corresponds  to  those  verified  uniform  which  were  correct¬ 
ly  labeled  as  being  a  member  of  one  of  K  classes  by  the  Chi-squared  test.  The 
fourth  row  corresponds  to  thos^  verified  uniform  which  were  Incorrectly 
labeled  by  the  Chi-squared  test.  Observe  that  only  6  (102)  of  the  correctly 
labeled  uniform  regions  were  called  unspecified  by  the  test. 

6.2  Evaluating  the.  CapabtUtU*  o£  the  noticing  Tut 

The  results  acquired  by  applying  the  simplified  version  of  the  ratioing 
test  to  the  102  regions  is  given  in  Table  6.  Also  shown  is  the  results 
of  applying  the  the  Chi-squared  test  with  a  •  .95  and  the  pairwise  classi¬ 
fication  procedure  to  these  same  102  samples.  The  first  row  shows  how 
the  65  verified  boundary  regions  were  handled.  Observe  that  the 
ratioing  test  alone  could  Identify  27  of  these  regions.  In  all  the 
ratioing  test  Identified  40  of  the  65  regions.  The  Chi-squared  test 
aided  by  detecting  10  more.  Thus  in  combination  the  two  tests  detected 
77%  of  boundary  regions. 

The  second  and  third  rows  show  how  verified  uniform  regions,  both 
those  which  were  correctly  and  incorrectly  classified  by  the  pairwise 
classifier  were  treated.  Observe  that  the  ratioing  test  mistakenly  called 
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COMPUTER  RESULTS 
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Table  6  :  A  *umary  of  the  result*  obtained  by  applying  the  simplified  forms 
of  the  ratloing  test  and  the  Chi-squared  test  to  102  regions. 

Mote  that  the  table  also  indicates  the  classification  results 
obtained  by  the  pairwise  classifier  for  the  102  regions. 
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7  (2 4%)  of  these  regions  boundary  region.  Further  the  Chi-squared  test 
incorrectly  labeled  an  additional  11  (38%).  The  poor  performance  of  the 
Chi-squared  test  in  this  case  is  probably  a  result  of  a  suboptimal  choice 
of  a. 

On  the  third  row  observe  that  of  the  8  incorrectly  labeled  uniforms  by 
the  classifier  6  (75%)  regions  were  identified  as  boundary  regions. 


7.  SUMMARY  AMD  CONCLUSIONS 


This  paper  described  a  study  aimed  at  segmenting  a  high  resolution 
urban  scene.  To  accomplish  this  objective  a  statistical  segmentation 
procedure  was  developed;  one  whose  primitive  operators  were  SGLDM  texture 
measures.  The  procedure  is  an  early  vision  system  based  on  a  split  type 
algorithm.  It  considers  three  types  of  regions  at  each  level  of  the  seg¬ 
mentation  ,  uniform,  boundary  and  unspecified.  The  procedure  is  based 
on  three  hypothesis  tests.  Experiments  were  performed  which  indicate 
the  utility  of  the  methodologies  employed.  In  particular  a  training  re¬ 
sult  of  901  overall  correct  classification  for  the  nine  classes  considered 
confirms  the  ability  of  the  texture  algorithm  to  characterize  land  use 
classes.  Further  an  approximate  lower  bound  of  83Z  correct  classification 
was  established  on  the  performance  of  the  segmentation  procedure  in  inde¬ 
pendent  testing  samples.  Finally  the  performance  of  the  tests  associated 
with  the  decision  to  split  a  region  were  evaluated.  The  results  show 
that  these  tests  performed  well  in  making  a  split  decision.  The  results 
obtained  substantiate  the  use  of  texture  operators  to  segment  complicated 
urban  scenes. 

As  stated  previously  the  segmentation  procedure  was  not  completely 
implemented  in  that  multiple  levels  could  not  be  considered.  A  further 
study  is  required  to  consider  this  case.  Also,  methods  to  incorporate 
more  world  knowledge  need  be  Investigated.  Finally  methods  for  improving 
the  measurement  selection  algorithm  should  be  studied  since  the  algorithm 
used  considers  only  the  ability  of  a  measure  to  correctly  classify 
uniform  regions. 
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